NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations

Zhong, Shaochen; Lu, Yifan; Shao, Lize; Bhushanam, Bhargav; Du, Xiaocong; Wan, Yixin; Shi, Yucheng; Zha, Daochen; Wang, Yiwei; Liu, Ninghao; et al (April 2025, The Thirteenth International Conference on Learning Representations (ICLR), April 24-28, 2025, Singapore.)

Free, publicly-accessible full text available April 25, 2026
Customized FinGPT Search Agents Using Foundation Models

https://doi.org/10.1145/3677052.3698637

Tian, Felix; Byadgi, Ajay; Kim, Daniel S; Zha, Daochen; White, Matt; Xiao, Kairong; Liu, Xiao-Yang (November 2024, ACM)

Full Text Available
MQUAKE-REMASTERED: MULTI-HOP KNOWLEDGE EDITING CAN ONLY BE ADVANCED WITH RELIABLE EVALUATIONS

Zhong, Shaochem; Lu, Yifan; Shao, Lize; Bhushanam, Bhargav; Du, Xiaocong; Wan, Yixin; Shi, Yucheng; Zha, Daochen; Wang, Yiwei; Liu, Ninghao; et al (January 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available January 22, 2026
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement Learning

https://doi.org/10.1145/3511808.3557474

Zha, Daochen; Lai, Kwei-Herng; Tan, Qiaoyu; Ding, Sirui; Zou, Na; Hu, Xia Ben (October 2022, 31st ACM International Conference on Information & Knowledge Management)

Full Text Available
In-Processing Modeling Techniques for Machine Learning Fairness: A Survey

https://doi.org/10.1145/3551390

Wan, Mingyang; Zha, Daochen; Liu, Ninghao; Zou, Na (January 2022, ACM Transactions on Knowledge Discovery from Data)

Machine learning models are becoming pervasive in high-stakes applications. Despite their clear benefits in terms of performance, the models could show discrimination against minority groups and result in fairness issues in a decision-making process, leading to severe negative impacts on the individuals and the society. In recent years, various techniques have been developed to mitigate the unfairness for machine learning models. Among them, in-processing methods have drawn increasing attention from the community, where fairness is directly taken into consideration during model design to induce intrinsically fair models and fundamentally mitigate fairness issues in outputs and representations. In this survey, we review the current progress of in-processing fairness mitigation techniques. Based on where the fairness is achieved in the model, we categorize them into explicit and implicit methods, where the former directly incorporates fairness metrics in training objectives, and the latter focuses on refining latent representation learning. Finally, we conclude the survey with a discussion of the research challenges in this community to motivate future exploration.
more » « less
Full Text Available
Dual Policy Distillation

https://doi.org/10.24963/ijcai.2020/435

Lai, Kwei-Herng; Zha, Daochen; Li, Yuening; Hu, Xia (July 2020, Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence)

Policy distillation, which transfers a teacher policy to a student policy has achieved great success in challenging tasks of deep reinforcement learning. This teacher-student framework requires a well-trained teacher model which is computationally expensive. Moreover, the performance of the student model could be limited by the teacher model if the teacher model is not optimal. In the light of collaborative learning, we study the feasibility of involving joint intellectual efforts from diverse perspectives of student models. In this work, we introduce dual policy distillation (DPD), a student-student framework in which two learners operate on the same environment to explore different perspectives of the environment and extract knowledge from each other to enhance their learning. The key challenge in developing this dual learning framework is to identify the beneficial knowledge from the peer learner for contemporary learning-based reinforcement learning algorithms, since it is unclear whether the knowledge distilled from an imperfect and noisy peer learner would be helpful. To address the challenge, we theoretically justify that distilling knowledge from a peer learner will lead to policy improvement and propose a disadvantageous distillation strategy based on the theoretical results. The conducted experiments on several continuous control tasks show that the proposed framework achieves superior performance with a learning-based agent and function approximation without the use of expensive teacher models.
more » « less
Full Text Available

Search for: All records